User interface language: English | Español

HL Paper 2

Charlotte decides to model the shape of a cupcake to calculate its volume.

From rotating a photograph of her cupcake she estimates that its cross-section passes through the points (0, 3.5), (4, 6), (6.5, 4), (7, 3) and (7.5, 0), where all units are in centimetres. The cross-section is symmetrical in the x-axis, as shown below:

She models the section from (0, 3.5) to (4, 6) as a straight line.

Charlotte models the section of the cupcake that passes through the points (4, 6), (6.5, 4), (7, 3) and (7.5, 0) with a quadratic curve.

Charlotte thinks that a quadratic with a maximum point at (4, 6) and that passes through the point (7.5, 0) would be a better fit.

Believing this to be a better model for her cupcake, Charlotte finds the volume of revolution about the x-axis to estimate the volume of the cupcake.

Find the equation of the line passing through these two points.

[2]
a.

Find the equation of the least squares regression quadratic curve for these four points.

[2]
b.i.

By considering the gradient of this curve when x=4, explain why it may not be a good model.

[1]
b.ii.

Find the equation of the new model.

[4]
c.

Write down an expression for her estimate of the volume as a sum of two integrals.

[4]
d.i.

Find the value of Charlotte’s estimate.

[1]
d.ii.



It is known that the weights of male Persian cats are normally distributed with mean 6.1kg and variance 0.52kg2.

A group of 80 male Persian cats are drawn from this population.

The male cats are now joined by 80 female Persian cats. The female cats are drawn from a population whose weights are normally distributed with mean 4.5kg and standard deviation 0.45kg.

Ten female cats are chosen at random.

Sketch a diagram showing the above information.

[2]
a.

Find the proportion of male Persian cats weighing between 5.5kg and 6.5kg.

[2]
b.

Determine the expected number of cats in this group that have a weight of less than 5.3kg.

[3]
c.

Find the probability that exactly one of them weighs over 4.62kg.

[4]
d.i.

Let N be the number of cats weighing over 4.62kg.

Find the variance of N.

[1]
d.ii.

A cat is selected at random from all 160 cats.

Find the probability that the cat was female, given that its weight was over 4.7kg.

[4]
e.



A city has two cable companies, X and Y. Each year 20 % of the customers using company X move to company Y and 10 % of the customers using company Y move to company X. All additional losses and gains of customers by the companies may be ignored.

Initially company X and company Y both have 1200 customers.

Write down a transition matrix T representing the movements between the two companies in a particular year.

[2]
a.

Find the eigenvalues and corresponding eigenvectors of T.

[4]
b.

Hence write down matrices P and D such that T = PDP−1.

[2]
c.

Find an expression for the number of customers company X has after n years, where n N .

[5]
d.

Hence write down the number of customers that company X can expect to have in the long term.

[1]
e.



In a small village there are two doctors’ clinics, one owned by Doctor Black and the other owned by Doctor Green. It was noted after each year that 3.5% of Doctor Black’s patients moved to Doctor Green’s clinic and 5% of Doctor Green’s patients moved to Doctor Black’s clinic. All additional losses and gains of patients by the clinics may be ignored.

At the start of a particular year, it was noted that Doctor Black had 2100 patients on their register, compared to Doctor Green’s 3500 patients.

Write down a transition matrix T indicating the annual population movement between clinics.

[2]
a.

Find a prediction for the ratio of the number of patients Doctor Black will have, compared to Doctor Green, after two years.

[2]
b.

Find a matrix P, with integer elements, such that T=PDP1, where D is a diagonal matrix.

[6]
c.

Hence, show that the long-term transition matrix T is given by T=10171017717717.

[6]
d.

Hence, or otherwise, determine the expected ratio of the number of patients Doctor Black would have compared to Doctor Green in the long term.

[2]
e.



The masses in kilograms of melons produced by a farm can be modelled by a normal distribution with a mean of 2.6 kg and a standard deviation of 0.5 kg.

Find the probability that two melons picked at random and independently of each other will

One year due to favourable weather conditions it is thought that the mean mass of the melons has increased.

The owner of the farm decides to take a random sample of 16 melons to test this hypothesis at the 5% significance level, assuming the standard deviation of the masses of the melons has not changed.

Unknown to the farmer the favourable weather conditions have led to all the melons having 10% greater mass than the model described above.

Find the probability that a melon selected at random will have a mass greater than 3.0kg.

[2]
a.

both have a mass greater than 3.0kg.

[2]
b.i.

have a total mass greater than 6.0kg.

[2]
b.ii.

Write down the null and alternative hypotheses for the test.

[1]
c.

Find the critical region for this test.

[4]
d.

Find the mean and standard deviation of the mass of the melons for this year.

[3]
e.

Find the probability of a Type II error in the owner’s test.

[2]
f.



The random variable X is thought to follow a binomial distribution B (4, p ). In order to investigate this belief, a random sample of 100 observations on X was taken with the following results.

An automatic machine is used to fill bottles of water. The amount delivered, Y  ml, may be assumed to be normally distributed with mean μ  ml and standard deviation 8 ml. Initially, the machine is adjusted so that the value of μ is 500. In order to check that the value of μ remains equal to 500, a random sample of 10 bottles is selected at regular intervals, and the mean amount of water, y ¯ , in these bottles is calculated. The following hypotheses are set up.

H0: μ  = 500;  H1: μ  ≠ 500

The critical region is defined to be  ( y ¯ < 495 ) ( y ¯ > 505 ) .

State suitable hypotheses for testing this belief.

[1]
a.i.

Calculate the mean of these data and hence estimate the value of p .

[5]
a.ii.

Calculate an appropriate value of χ 2  and state your conclusion, using a 1% significance level.

[13]
a.iii.

Find the significance level of this procedure.

[5]
b.i.

Some time later, the actual value of μ  is 503. Find the probability of a Type II error.

[3]
b.ii.



A student investigating the relationship between chemical reactions and temperature finds the Arrhenius equation on the internet.

k=Ae-cT

This equation links a variable k with the temperature T, where A and c are positive constants and T>0.

The Arrhenius equation predicts that the graph of lnk against 1T is a straight line.

Write down

The following data are found for a particular reaction, where T is measured in Kelvin and k is measured in cm3mol1s1:

Find an estimate of

Show that dkdT is always positive.

[3]
a.

Given that limTk=A and limT0k=0, sketch the graph of k against T.

[3]
b.

(i)   the gradient of this line in terms of c;

(ii)  the y-intercept of this line in terms of A.

[4]
c.

Find the equation of the regression line for lnk on 1T.

[2]
d.

c.

It is not required to state units for this value.

[1]
e.i.

A.

It is not required to state units for this value.

[2]
e.ii.



Willow finds that she receives approximately 70 emails per working day.

She decides to model the number of emails received per working day using the random variable X , where X follows a Poisson distribution with mean 70.

In order to test her model, Willow records the number of emails she receives per working day over a period of 6 months. The results are shown in the following table.

From the table, calculate

Archie works for a different company and knows that he receives emails according to a Poisson distribution, with a mean of λ emails per day.

Using this distribution model, find  P ( X < 60 ) .

[2]
a.i.

Using this distribution model, find the standard deviation of X .

[2]
a.ii.

an estimate for the mean number of emails received per working day.

[3]
b.i.

an estimate for the standard deviation of the number of emails received per working day.

[2]
b.ii.

Give one piece of evidence that suggests Willow’s Poisson distribution model is not a good fit.

[1]
c.

Suppose that the probability of Archie receiving more than 10 emails in total on any one day is 0.99. Find the value of λ.

[3]
d.

Now suppose that Archie received exactly 20 emails in total in a consecutive two day period. Show that the probability that he received exactly 10 of them on the first day is independent of λ.

[5]
e.



Long term experience shows that if it is sunny on a particular day in Vokram, then the probability that it will be sunny the following day is 0.8. If it is not sunny, then the probability that it will be sunny the following day is 0.3.

The transition matrix T is used to model this information, where T=0.8 0.30.2 0.7.

The matrix T can be written as a product of three matrices, PDP-1 , where D is a diagonal matrix.

It is sunny today. Find the probability that it will be sunny in three days’ time.

[2]
a.

Find the eigenvalues and eigenvectors of T.

[5]
b.

Write down the matrix P.

[1]
c.i.

Write down the matrix D.

[1]
c.ii.

Hence find the long-term percentage of sunny days in Vokram.

[4]
d.



A continuous random variable X has probability density function f given by

f ( x ) = { x 2 a + b , 0 x 4 0 otherwise where  a  and  b  are positive constants.

It is given that P ( X 2 ) = 0.75 .

Eight independent observations of X are now taken and the random variable Y is the number of observations such that X 2 .

Show that a = 32 and b = 1 12 .

[5]
a.

Find E ( X ) .

[2]
b.

Find Var ( X ) .

[2]
c.

Find the median of X .

[3]
d.

Find E ( Y ) .

[2]
e.

Find P ( Y 3 ) .

[1]
f.



Arianne plays a game of darts.

The distance that her darts land from the centre, O, of the board can be modelled by a normal distribution with mean 10cm and standard deviation 3cm.

Find the probability that

Each of Arianne’s throws is independent of her previous throws.

In a competition a player has three darts to throw on each turn. A point is scored if a player throws all three darts to land within a central area around O. When Arianne throws a dart the probability that it lands within this area is 0.8143.

In the competition Arianne has ten turns, each with three darts.

a dart lands less than 13cm from O.

[2]
a.i.

a dart lands more than 15cm from O.

[1]
a.ii.

Find the probability that Arianne throws two consecutive darts that land more than 15cm from O.

[2]
b.

Find the probability that Arianne does not score a point on a turn of three darts.

[2]
c.

Find Arianne’s expected score in the competition.

[4]
d.i.

Find the probability that Arianne scores at least 5 points in the competition.

[1]
d.ii.

Find the probability that Arianne scores at least 5 points and less than 8 points.

[2]
d.iii.

Given that Arianne scores at least 5 points, find the probability that Arianne scores less than 8 points.

[2]
d.iv.



A Chocolate Shop advertises free gifts to customers that collect three vouchers. The vouchers are placed at random into 10% of all chocolate bars sold at this shop. Kati buys some of these bars and she opens them one at a time to see if they contain a voucher. Let P ( X = n ) be the probability that Kati obtains her third voucher on the n th  bar opened.

(It is assumed that the probability that a chocolate bar contains a voucher stays at 10% throughout the question.)

It is given that P ( X = n ) = n 2 + a n + b 2000 × 0.9 n 3 for n 3 ,   n N .

Kati’s mother goes to the shop and buys x  chocolate bars. She takes the bars home for Kati to open.

Show that P ( X = 3 ) = 0.001 and P ( X = 4 ) = 0.0027 .

[3]
a.

Find the values of the constants a and b .

[5]
b.

Deduce that P ( X = n ) P ( X = n 1 ) = 0.9 ( n 1 ) n 3 for n > 3 .

[4]
c.

(i)     Hence show that X has two modes m 1 and m 2 .

(ii)     State the values of m 1 and m 2 .

[5]
d.

Determine the minimum value of x such that the probability Kati receives at least one free gift is greater than 0.5.

[3]
e.



The weights, in grams, of individual packets of coffee can be modelled by a normal distribution, with mean 102g and standard deviation 8g.

Find the probability that a randomly selected packet has a weight less than 100g.

[2]
a.

The probability that a randomly selected packet has a weight greater than w grams is 0.444. Find the value of w.

[2]
b.

A packet is randomly selected. Given that the packet has a weight greater than 105g, find the probability that it has a weight greater than 110g.

[3]
c.

From a random sample of 500 packets, determine the number of packets that would be expected to have a weight lying within 1.5 standard deviations of the mean.

[3]
d.

Packets are delivered to supermarkets in batches of 80. Determine the probability that at least 20 packets from a randomly selected batch have a weight less than 95g.

[4]
e.



Hank sets up a bird table in his garden to provide the local birds with some food. Hank notices that a specific bird, a large magpie, visits several times per month and he names him Bill. Hank models the number of times per month that Bill visits his garden as a Poisson distribution with mean 3.1.

Over the course of 3 consecutive months, find the probability that Bill visits the garden:

Using Hank’s model, find the probability that Bill visits the garden on exactly four occasions during one particular month.

[1]
a.

on exactly 12 occasions.

[2]
b.i.

during the first and third month only.

[3]
b.ii.

Find the probability that over a 12-month period, there will be exactly 3 months when Bill does not visit the garden.

[4]
c.

After the first year, a number of baby magpies start to visit Hank’s garden. It may be assumed that each of these baby magpies visits the garden randomly and independently, and that the number of times each baby magpie visits the garden per month is modelled by a Poisson distribution with mean 2.1.

Determine the least number of magpies required, including Bill, in order that the probability of Hank’s garden having at least 30 magpie visits per month is greater than 0.2.

[4]
d.



It is given that one in five cups of coffee contain more than 120 mg of caffeine.
It is also known that three in five cups contain more than 110 mg of caffeine.

Assume that the caffeine content of coffee is modelled by a normal distribution.
Find the mean and standard deviation of the caffeine content of coffee.




John likes to go sailing every day in July. To help him make a decision on whether it is safe to go sailing he classifies each day in July as windy or calm. Given that a day in July is calm, the probability that the next day is calm is 0.9. Given that a day in July is windy, the probability that the next day is calm is 0.3. The weather forecast for the 1st July predicts that the probability that it will be calm is 0.8.

Draw a tree diagram to represent this information for the first three days of July.

[3]
a.

Find the probability that the 3rd July is calm.

[2]
b.

Find the probability that the 1st July was calm given that the 3rd July is windy.

[4]
c.



The curve  y = f ( x )  is shown in the graph, for  0 x 10 .

The curve  y = f ( x )  passes through the following points.

It is required to find the area bounded by the curve, the x -axis, the y -axis and the line x = 10 .

One possible model for the curve  y = f ( x )  is a cubic function.

Use the trapezoidal rule to find an estimate for the area.

[3]
a.

Use all the coordinates in the table to find the equation of the least squares cubic regression curve.

[3]
b.i.

Write down the coefficient of determination.

[1]
b.ii.

Write down an expression for the area enclosed by the cubic regression curve, the x -axis, the y -axis and the line x = 10 .

[1]
c.i.

Find the value of this area.

[2]
c.ii.



A café serves sandwiches and cakes. Each customer will choose one of the following three options; buy only a sandwich, buy only a cake or buy both a sandwich and a cake.

The probability that a customer buys a sandwich is 0.72 and the probability that a customer buys a cake is 0.45.

Find the probability that a customer chosen at random will buy

On a typical day 200 customers come to the café.

It is known that 46 % of the customers who come to the café are male, and that 80 % of these buy a sandwich.

both a sandwich and a cake.

[3]
a.i.

only a sandwich.

[1]
a.ii.

Find the expected number of cakes sold on a typical day.

[1]
b.i.

Find the probability that more than 100 cakes will be sold on a typical day.

[3]
b.ii.

A customer is selected at random. Find the probability that the customer is male and buys a sandwich.

[1]
c.i.

A female customer is selected at random. Find the probability that she buys a sandwich.

[4]
c.ii.



Dana has collected some data regarding the heights h (metres) of waves against a pier at 50 randomly chosen times in a single day. This data is shown in the table below.

She wishes to perform a χ2-test at the 5% significance level to see if the height of waves could be modelled by a normal distribution. Her null hypothesis is

H0: The data can be modelled by a normal distribution.

From the table she calculates the mean of the heights in her sample to be 0.828m and the standard deviation of the heights sn to be 0.257m.

She calculates the expected values for each interval under this null hypothesis, and some of these values are shown in the table below.

Use the given value of sn to find the value of sn-1.

[2]
a.

Find the value of a and the value of b, giving your answers correct to one decimal place.

[3]
b.

Find the value of the χ2 test statistic χcalc2 for this test.

[2]
c.

Determine the degrees of freedom for Dana’s test.

[2]
d.

It is given that the critical value for this test is 9.49.

State the conclusion of the test in context. Use your answer to part (c) to justify your conclusion.

[2]
e.



The times taken for male runners to complete a marathon can be modelled by a normal distribution with a mean 196 minutes and a standard deviation 24 minutes.

It is found that 5% of the male runners complete the marathon in less than T 1 minutes.

The times taken for female runners to complete the marathon can be modelled by a normal distribution with a mean 210 minutes. It is found that 58% of female runners complete the marathon between 185 and 235 minutes.

Find the probability that a runner selected at random will complete the marathon in less than 3 hours.

[2]
a.

Calculate T 1 .

[2]
b.

Find the standard deviation of the times taken by female runners.

[4]
c.



A Principal would like to compare the students in his school with a national standard. He decides to give a test to eight students made up of four boys and four girls. One of the teachers offers to find the volunteers from his class.

The marks out of 40, for the students who took the test, are:

25,   29,   38,   37,   12,   18,   27,   31.

For the eight students find

The national standard mark is 25.2 out of 40.

Two additional students take the test at a later date and the mean mark for all ten students is 28.1 and the standard deviation is 8.4.

For further analysis, a standardized score out of 100 for the ten students is obtained by multiplying the scores by 2 and adding 20.

For the ten students, find

Name the type of sampling that best describes the method used by the Principal.

[1]
a.

the mean mark.

[2]
b.i.

the standard deviation of the marks.

[1]
b.ii.

Perform an appropriate test at the 5% significance level to see if the mean marks achieved by the students in the school are higher than the national standard. It can be assumed that the marks come from a normal population.

[5]
c.

State one reason why the test might not be valid.

[1]
d.

their mean standardized score.

[1]
e.i.

the standard deviation of their standardized score.

[2]
e.ii.



The number of marathons that Audrey runs in any given year can be modelled by a Poisson distribution with mean 1.3 .

Calculate the probability that Audrey will run at least two marathons in a particular year.

[2]
a.

Find the probability that she will run at least two marathons in exactly four out of the following five years.

[4]
b.



Packets of biscuits are produced by a machine. The weights X , in grams, of packets of biscuits can be modelled by a normal distribution where X N ( μ ,   σ 2 ) . A packet of biscuits is considered to be underweight if it weighs less than 250 grams.

The manufacturer makes the decision that the probability that a packet is underweight should be 0.002. To do this μ is increased and σ remains unchanged.

The manufacturer is happy with the decision that the probability that a packet is underweight should be 0.002, but is unhappy with the way in which this was achieved. The machine is now adjusted to reduce σ and return μ to 253.

Given that μ = 253 and σ = 1.5 find the probability that a randomly chosen packet of biscuits is underweight.

[2]
a.

Calculate the new value of μ giving your answer correct to two decimal places.

[3]
b.

Calculate the new value of σ .

[2]
c.



In a reforested area of pine trees, heights of trees planted in a specific year seem to follow a normal distribution. A sample of 100 such trees is selected to test the validity of this hypothesis. The results of measuring tree heights, to the nearest centimetre, are recorded in the first two columns of the table below.

Describe what is meant by

a goodness of fit test (a complete explanation required);

[2]
a.i.

the level of significance of a hypothesis test.

[1]
a.ii.

Find the mean and standard deviation of the sample data in the table above. Show how you arrived at your answers.

[4]
b.

Most of the expected frequencies have been calculated in the third column. (Frequencies have been rounded to the nearest integer, and frequencies in the first and last classes have been extended to include the rest of the data beyond 15 and 225. Find the values of a , b  and c and show how you arrived at your answers.

[4]
c.

In order to test for the goodness of fit, the test statistic was calculated to be 1.0847. Show how this was done.

[3]
d.

State your hypotheses, critical number, decision rule and conclusion (using a 5% level of significance).

[5]
e.



The random variable X has a normal distribution with mean μ = 50 and variance σ 2 = 16 .

Sketch the probability density function for X, and shade the region representing P(μ − 2σ < X < μ + σ).

[2]
a.

Find the value of P(μ − 2σ < X < μ + σ).

[2]
b.

Find the value of k for which P(μkσ < X < μ + kσ) = 0.5.

[2]
c.



Loreto is a manager at the Da Vinci health centre. If the mean rate of patients arriving at the health centre exceeds 1.5 per minute then Loreto will employ extra staff. It is assumed that the number of patients arriving in any given time period follows a Poisson distribution.

Loreto performs a hypothesis test to determine whether she should employ extra staff. She finds that 320 patients arrived during a randomly selected 3-hour clinic.

Loreto is also concerned about the average waiting time for patients to see a nurse. The health centre aims for at least 95% of patients to see a nurse in under 20 minutes.

Loreto assumes that the waiting times for patients are independent of each other and decides to perform a hypothesis test at a 10% significance level to determine whether the health centre is meeting its target.

Loreto surveys 150 patients and finds that 11 of them waited more than 20 minutes.

Write down null and alternative hypotheses for Loreto’s test.

[2]
a.i.

Using the data from Loreto’s sample, perform the hypothesis test at a 5% significance level to determine if Loreto should employ extra staff.

[5]
a.ii.

Write down null and alternative hypotheses for this test.

[2]
b.i.

Perform the test, clearly stating the conclusion in context.

[5]
b.ii.



The marks achieved by eight students in a class test are given in the following list.

The teacher increases all the marks by 2. Write down the new value for

the standard deviation.




A random variable X is normally distributed with mean μ  and standard deviation σ , such that P ( X < 30.31 ) = 0.1180 and P ( X > 42.52 ) = 0.3060 .

Find μ and σ .

[6]
a.

Find P ( | X μ | < 1.2 σ ) .

[2]
b.



A geneticist uses a Markov chain model to investigate changes in a specific gene in a cell as it divides. Every time the cell divides, the gene may mutate between its normal state and other states.

The model is of the form

Xn+1Zn+1=MXnZn

where Xn is the probability of the gene being in its normal state after dividing for the nth time, and Zn is the probability of it being in another state after dividing for the nth time, where n.

Matrix M is found to be 0.94  b0.06  0.98.

The gene is in its normal state when n=0. Calculate the probability of it being in its normal state

Write down the value of b.

[1]
a.i.

What does b represent in this context?

[1]
a.ii.

Find the eigenvalues of M.

[3]
b.

Find the eigenvectors of M.

[3]
c.

when n=5.

[2]
d.i.

in the long term.

[2]
d.ii.



Scientists have developed a type of corn whose protein quality may help chickens gain weight faster than the present type used. To test this new type, 20 one-day-old chicks were fed a ration that contained the new corn while another control group of 20 chicks was fed the ordinary corn. The data below gives the weight gains in grams, for each group after three weeks.

The scientists wish to investigate the claim that Group B gain weight faster than Group A. Test this claim at the 5% level of significance, noting which hypothesis test you are using. You may assume that the weight gain for each group is normally distributed, with the same variance, and independent from each other.

[6]
a.

The data from the two samples above are combined to form a single set of data. The following frequency table gives the observed frequencies for the combined sample. The data has been divided into five intervals.

Test, at the 5% level, whether the combined data can be considered to be a sample from a normal population with a mean of 380.

[10]
b.



The random variable X has a binomial distribution with parameters n and p.
It is given that E(X) = 3.5.

Find the least possible value of n.

[2]
a.

It is further given that P(X ≤ 1) = 0.09478 correct to 4 significant figures.

Determine the value of n and the value of p.

[5]
b.



A survey of British holidaymakers found that 15% of those surveyed took a holiday in the Lake District in 2019.

A random sample of 16 British holidaymakers was taken. The number of people in the sample who took a holiday in the Lake District in 2019 can be modelled by a binomial distribution.

State two assumptions made in order for this model to be valid.

[2]
a.i.

Find the probability that at least three people from the sample took a holiday in the Lake District in 2019.

[2]
a.ii.

From a random sample of n holidaymakers, the probability that at least one of them took a holiday in the Lake District in 2019 is greater than 0.999.

Determine the least possible value of n.

[3]
b.



A discrete random variable X follows a Poisson distribution Po ( μ ) .

Show that P ( X = x + 1 ) = μ x + 1 × P ( X = x ) ,   x N .

[3]
a.

Given that P ( X = 2 ) = 0.241667  and P ( X = 3 ) = 0.112777 , use part (a) to find the value of μ .

[3]
b.



Jenna is a keen book reader. The number of books she reads during one week can be modelled by a Poisson distribution with mean 2.6.

Determine the expected number of weeks in one year, of 52 weeks, during which Jenna reads at least four books.




It is known that 56 % of Infiglow batteries have a life of less than 16 hours, and 94 % have a life less than 17 hours. It can be assumed that battery life is modelled by the normal distribution N ( μ , σ 2 ) .

Find the value of μ and the value of σ .

[6]
a.

Find the probability that a randomly selected Infiglow battery will have a life of at least 15 hours.

[2]
b.



The continuous random variable X has probability density function  f given by

f ( x ) = { 3 a x , 0 x < 0.5 a ( 2 x ) , 0.5 x < 2 0 , otherwise

 

Show that  a = 2 3 .

[3]
a.

Find  P ( X < 1 ) .

[3]
b.

Given that P ( s < X < 0.8 ) = 2 × P ( 2 s < X < 0.8 ) , and that 0.25 < s < 0.4 , find the value of s.

[7]
c.



Steffi the stray cat often visits Will’s house in search of food. Let X be the discrete random variable “the number of times per day that Steffi visits Will’s house”.

The random variable X can be modelled by a Poisson distribution with mean 2.1.

Let Y be the discrete random variable “the number of times per day that Steffi is fed at Will’s house”. Steffi is only fed on the first four occasions that she visits each day.

Find the probability that on a randomly selected day, Steffi does not visit Will’s house.

[2]
a.

Copy and complete the probability distribution table for Y.

[4]
b.

Hence find the expected number of times per day that Steffi is fed at Will’s house.

[3]
c.

In any given year of 365 days, the probability that Steffi does not visit Will for at most n days in total is 0.5 (to one decimal place). Find the value of n .

[3]
d.

Show that the expected number of occasions per year on which Steffi visits Will’s house and is not fed is at least 30.

[4]
e.



Timmy owns a shop. His daily income from selling his goods can be modelled as a normal distribution, with a mean daily income of $820, and a standard deviation of $230. To make a profit, Timmy’s daily income needs to be greater than $1000.

Calculate the probability that, on a randomly selected day, Timmy makes a profit.

[2]
a.

The shop is open for 24 days every month.

Calculate the probability that, in a randomly selected month, Timmy makes a profit on between 5 and 10 days (inclusive).

[3]
b.



The number of bananas that Lucca eats during any particular day follows a Poisson distribution with mean 0.2.

Find the probability that Lucca eats at least one banana in a particular day.

[2]
a.

Find the expected number of weeks in the year in which Lucca eats no bananas.

[4]
b.



Events A and B are such that P ( A B ) = 0.95 ,  P ( A B ) = 0.6 and P ( A | B ) = 0.75 .

Find  P ( B ) .

[2]
a.

Find P ( A ) .

[2]
b.

Hence show that events A and B are independent.

[2]
c.



A horse breeder records the number of births for each of 100 horses during the past eight years. The results are summarized in the following table:

Stating null and alternative hypotheses carry out an appropriate test at the 5% significance level to decide whether the results can be modelled by B (6, 0.5).

[10]
a.

Without doing any further calculations, explain briefly how you would carry out a test, at the 5% significance level, to decide if the data can be modelled by B(6, p ), where p  is unspecified.

[2]
b.

A different horse breeder collected data on the time and outcome of births. The data are summarized in the following table:

Carry out an appropriate test at the 5% significance level to decide whether there is an association between time and outcome.

[8]
c.



Each of the 25 students in a class are asked how many pets they own. Two students own three pets and no students own more than three pets. The mean and standard deviation of the number of pets owned by students in the class are  18 25 and  24 25  respectively.

Find the number of students in the class who do not own a pet.




Iqbal attempts three practice papers in mathematics. The probability that he passes the first paper is 0.6. Whenever he gains a pass in a paper, his confidence increases so that the probability of him passing the next paper increases by 0.1. Whenever he fails a paper the probability of him passing the next paper is 0.6.

Complete the given probability tree diagram for Iqbal’s three attempts, labelling each branch with the correct probability.

[3]
a.

Calculate the probability that Iqbal passes at least two of the papers he attempts.

[2]
b.

Find the probability that Iqbal passes his third paper, given that he passed only one previous paper.

[3]
c.



There are 75 players in a golf club who take part in a golf tournament. The scores obtained on the 18th hole are as shown in the following table.

M17/5/MATHL/HP2/ENG/TZ2/01

One of the players is chosen at random. Find the probability that this player’s score was 5 or more.

[2]
a.

Calculate the mean score.

[2]
b.



Jorge is carefully observing the rise in sales of a new app he has created.

The number of sales in the first four months is shown in the table below.

Jorge believes that the increase is exponential and proposes to model the number of sales N in month t with the equation

N=Aert, A,r

Jorge plans to adapt Euler’s method to find an approximate value for r.

With a step length of one month the solution to the differential equation can be approximated using Euler’s method where

Nn+1Nn+1×N'n, n

Jorge decides to take the mean of these values as the approximation of r for his model. He also decides the graph of the model should pass through the point (2, 52).

The sum of the square residuals for these points for the least squares regression model is approximately 6.555.

Show that Jorge’s model satisfies the differential equation

dNdt=rN

[2]
a.

Show that rNn+1-NnNn

[3]
b.

Hence find three approximations for the value of r.

[3]
c.

Find the equation for Jorge’s model.

[3]
d.

Find the sum of the square residuals for Jorge’s model using the values t=1, 2, 3, 4.

[2]
e.

Comment how well Jorge’s model fits the data.

[1]
f.i.

Give two possible sources of error in the construction of his model.

[2]
f.ii.



A random variable X has a probability distribution given in the following table.

N16/5/MATHL/HP2/ENG/TZ0/01

Determine the value of E ( X 2 ) .

[2]
a.

Find the value of Var ( X ) .

[3]
b.



When carpet is manufactured, small faults occur at random. The number of faults in Premium carpets can be modelled by a Poisson distribution with mean 0.5 faults per 20 m2. Mr Jones chooses Premium carpets to replace the carpets in his office building. The office building has 10 rooms, each with the area of 80 m2.

Find the probability that the carpet laid in the first room has fewer than three faults.

[3]
a.

Find the probability that exactly seven rooms will have fewer than three faults in the carpet.

[3]
b.



The hens on a farm lay either white or brown eggs. The eggs are put into boxes of six. The farmer claims that the number of brown eggs in a box can be modelled by the binomial distribution, B(6, p ). By inspecting the contents of 150 boxes of eggs she obtains the following data.

Show that this data leads to an estimated value of p = 0.4 .

[1]
a.

Stating null and alternative hypotheses, carry out an appropriate test at the 5 % level to decide whether the farmer’s claim can be justified.

[11]
b.



The number of taxis arriving at Cardiff Central railway station can be modelled by a Poisson distribution. During busy periods of the day, taxis arrive at a mean rate of 5.3 taxis every 10 minutes. Let T represent a random 10 minute busy period.

Find the probability that exactly 4 taxis arrive during T.

[2]
a.i.

Find the most likely number of taxis that would arrive during T.

[2]
a.ii.

Given that more than 5 taxis arrive during T, find the probability that exactly 7 taxis arrive during T.

[3]
a.iii.

During quiet periods of the day, taxis arrive at a mean rate of 1.3 taxis every 10 minutes.

Find the probability that during a period of 15 minutes, of which the first 10 minutes is busy and the next 5 minutes is quiet, that exactly 2 taxis arrive.

[6]
b.



The age, L, in years, of a wolf can be modelled by the normal distribution L ~ N(8, 5).

Find the probability that a wolf selected at random is at least 5 years old.

[2]
a.

Eight wolves are independently selected at random and their ages recorded.

Find the probability that more than six of these wolves are at least 5 years old.

[3]
b.



Consider two events A and B such that P ( A ) = k ,  P ( B ) = 3 k ,  P ( A B ) = k 2 and P ( A B ) = 0.5 .

Calculate k ;

[3]
a.

Find P ( A B ) .

[3]
b.



The mean number of squirrels in a certain area is known to be 3.2 squirrels per hectare of woodland. Within this area, there is a 56 hectare woodland nature reserve. It is known that there are currently at least 168 squirrels in this reserve.

Assuming the population of squirrels follow a Poisson distribution, calculate the probability that there are more than 190 squirrels in the reserve.